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O ' Abstract 

' We consider the problem of performing a random walk in a distributed network. Given 

lO . bandwidth constraints, the goal of the problem is to minimize the number of rounds required to 

obtain a random walk sample. Das Sarma et al. [PODC'IO] show that a random walk of length 
£ on a network of diameter D can be performed in 0(\/ZD + D) time. A major question left 
' open is whether there exists a faster algorithm, especially whether the multiplication of ^fi and 

y/D is necessary. 

In this paper, we show a tight unconditional lower bound on the time complexity of dis- 
tributed random walk computation. Specifically, we show that for any n, D, and D < i < 
{n/(D^ log n))^/"*, performing a random walk of length on an n-node network of diameter D 
requires + D) time. This bound is unconditional^ i.e., it holds for any (possibly random- 

ized) algorithm. To the best of our knowledge, this is the first lower bound that the diameter 
\^ . plays a role of multiplicative factor. Our bound shows that the algorithm of Das Sarma et al. is 

' time optimal. 

0^ . Our proof technique introduces a new connection between bounded-round communication 

complexity and distributed algorithm lower bounds with _D as a trade-off parameter, strength- 
CsJ , ening the previous study by Das Sarma et al. [STOC'll]. In particular, we make use of the 

' bounded-round communication complexity of the pointer chasing problem. Our technique can 

be of independent interest and may be useful in showing non-trivial lower bounds on the com- 
plexity of other fundamental distributed computing problems. 



Keywords: Random walks. Distributed algorithms. Lower bounds. Communication complexity. 



1 Introduction 



The random walk plays a central role in computer science, spanning a wide range of areas in both 
theory and practice. The focus of this paper is on performing a random walk in distributed networks, 
in particular, decentralized algorithms for performing a random walk in arbitrary networks. The 
random walk is used as an integral subroutine in a wide variety of network applications ranging 
from token management [19^ HJ [8] , small- world routing [21] , search [311 [H \7\ [TBI [27] , information 
propagation and gathering OHO], network topology construction [T8l[2ll[25], expander testing [12], 
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part by NSF grant CCF-1023166 and by a grant from the United States-Israel Binational Science Foundation (BSF). 
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constructing random spanning trees O [3l [2] , distributed construction of expander networks [21] , 
and peer-to-peer membership management [161 I32j . For more applications of random walks to 
distributed networks, see e.g. [IT]. Motivated by the wide applicability of the random walk, [10} [TT] 
consider the running time of performing a random walk on the synchronous distributed model. We 
now explain the model and problems before describing previous work and our results. 

1.1 Distributed Computing Model 

Consider a synchronous network of processors with unbounded computational power. The network 
is modeled by an undirected connected n-node multi-graph, where nodes model the processors and 
edges model the links between the processors. The processors (henceforth, nodes) communicate by 
exchanging messages via the links (henceforth, edges). The nodes have limited global knowledge, in 
particular, each of them has its own local perspective of the network (a.k.a graph), which is confined 
to its immediate neighborhood. 

There are several measures to analyze the performance of algorithms on this model, a fun- 
damental one being the running time, defined as the worst-case number of rounds of distributed 
communication. This measure naturally gives rise to a complexity measure of problems, called the 
time complexity. On each round at most O(logn) bits can be sent through each edge in each di- 
rection. This is a standard model of distributed computation known as the COMQSST model [29] 
and has been attracting a lot of research attention during last two decades (e.g., see [29j and the 
references therein). We note that our result also holds on the CONQ£S'T{B) model, where on 
each round at most B bits can be sent through each edge in each direction (see the remark after 
Theorem II. ip . We ignore this parameter to make the proofs and theorem statements simpler. 

1.2 Problems 

The basic problem is computing a random walk where destination outputs source, defined as follows. 
We are given a network G = {V, E) and a source node s ^ V . The goal is to devise a distributed 
algorithm such that, in the end, some node v outputs the ID of s, where v is a destination node 
picked according to the probability that it is the destination of a random walk of length i starting 
at s. We assume the standard random walk where, in each step, an edge is taken from the current 
node V with probability proportional to l/d{v) where d{v) is the degree of v. Our goal is to output 
a true random sample from the ^-walk distribution starting from s. 

For clarity, observe that the following naive algorithm solves the above problem in 0{tj rounds. 
The walk of length H. is performed by sending a token for ^ steps, picking a random neighbor in 
each step. Then, the destination node v of this walk outputs the ID of s. The main objective 
of distributed random walk problem is to perform such sampling with significantly less number of 
rounds, i.e., in time that is sublinear in I. On the other hand, we note that it can take too much time 
(as much as 0(|-E| + D) time) in the COMQEST model to collect all the topological information at 
the source node (and then computing the walk locally). 

The following variations were also previously considered. 

1. Computing a random walk where source outputs destination: The problem is almost the same 
as above except that, in the end, the source has to output the ID of the destination. This 
version is useful in nodes learning the topology of their surrounding networks and related 
applications such as a decentralized algorithm for estimating the mixing time 

2. Computing a random walk where nodes know their positions: Instead of outputting the ID of 
source or destination, we want each node to know its position(s) in the random walk. That 
is, if vi,V2, ■■■,ve (where vi = s) is the result random walk starting at s, we want each node Vj 
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in the walk to know the number j at the end of the process. This version is used to construct 
a random spanning tree in 

1.3 Previous work and our result 

A key purpose of the random walk in many network applications is to perform node sampling. While 
the sampling requirements in different applications vary, whenever a true sample is required from 
a random walk of certain steps, typically all applications perform the walk naively — by simply 
passing a token from one node to its neighbor: thus to perform a random walk of length i takes 
time linear in i. Das Sarma et al. [10] showed this is not a time-optimal strategy and the running 
time can be made sublinear in i, i.e., performing a random walk of length I on an n-node network 
of diameter D can be done in 0(£^/^D^/^) time where O hides polylogn. Subsequently, Das Sarma 
et al. [11] improved this bound to 0{y/iD + D) which holds for all three versions of the problem. 

There are two key motivations for obtaining sublinear time bounds. The first is that in many 
algorithmic applications, walks of length significantly greater than the network diameter are needed. 
For example, this is necessary in two applications presented in |llj . namely distributed computation 
of a random spanning tree (RST) and computation of mixing time. More generally, many real- 
world communication networks (e.g., ad hoc networks and peer-to-peer networks) have relatively 
small diameter, and random walks of length at least the diameter are usually performed for many 
sampling applications, i.e., i » D. 

The second motivation is understanding the time complexity of distributed random walks. Ran- 
dom walk is essentially a global problem which requires the algorithm to "traverse" the entire 
network. Classical "global" problems include the minimum spanning tree, shortest path etc. Net- 
work diameter is an inherent lower bound for such problems. Problems of this type raise the basic 
question whether n (or i as the case here) time is essential or is the network diameter D, the inherent 
parameter. As pointed out in the seminal work of [T7j, in the latter case, it would be desirable to 
design algorithms that have a better complexity for graphs with low diameter. While both upper 
and lower bounds of time complexity of many "global" problems are known (see, e.g., [9]), the status 
of the random walk problem is still wide open. 

A preliminary attempt to show a random walk lower bound is presented in ^llj . They consider 
a restricted class of algorithms, where each message sent between nodes must be in the form of 
an interval of numbers. Moreover, a node is allowed to send a number or an interval containing it 
only after it receives such number. For this very restricted class of algorithms, a lower bound of 
n{V£ + D) is shown [TT] for the version where every node must know their positions in the end of 
the computation. While this lower bound shows a potential limitation of random walk algorithms, 
it has many weaknesses. First, it does not employ an information theoretic argument and thus does 
not hold for all types of algorithm. Instead, it assumes that the algorithms must send messages 
as intervals and thus holds only for a small class of algorithms, which does not even cover all 
deterministic algorithms. Second, the lower bound holds only for the version where nodes know 
their position(s), thereby leaving lower bounds for the other two random walk versions completely 
open. More importantly, there is still a gap of ^/D between lower and upper bounds, leaving a 
question whether there is a faster algorithm. 

Motivated by these applications, past results, and open problems, we consider the problem of 
finding lower bounds for random walk computation. In this work, we show an unconditional lower 
bound of VL{y/ ID + D) for all three versions of the random walk computation problem. This means 
that the algorithm in [TT] is optimal for all three variations. In particular, we show the following 
theorem. 

Theorem 1.1. For any n, D and I such that D < £ < (n/(D^ log n))-*^/^, there exists a family of 
n-node networks of diameter D such that performing a random walk (any of the three versions) of 
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length @{£) on these networks requires Q{^/iD + D) rounds. 

We note that our lower bound of i}{\/£D + D) also holds for the general COMQ£ST{B) model, 
where each edge has bandwidth B instead of O(logn), as long as ^ < {n/{D^B))^/^. Moreover, one 
can also show a lower bound on simple graphs by subdividing edges in the network used in the proof 
and double the value of i. 

1.4 Techniques and proof overview 

Our main approach relies on enhancing the connection between communication complexity and dis- 
tributed algorithm lower bounds first studied in [9]. It has been shown in [9j that a fast distributed 
algorithm for computing a function can be converted into a two-party communication protocol with 
small message complexity to compute the same function. In other words, the communication com- 
plexity lower bounds implies the time complexity lower bounds of distributed algorithms. This result 
is then used to prove lower bounds on many verification problems. (In the verification problems, 
we are given H, a subgraph of the network G, where each vertex of G knows which edges incident 
on it are in H. We would like to verify whether H has some properties, e.g., if it is a tree or if it 
is connected.) The lower bounds of verification problems are then used to prove lower bounds of 
approximation algorithms for many graph problems. Their work, however, does not make progress 
on achieving any unconditional lower bound on the random walk problem. 

Further, while this approach has been successfully used to show lower bounds for several problems 
in terms of network size (i.e., n), it is not clear how to apply them to random walk computation. 
All the lower bounds previously shown are for optimization problems for well-defined metrics - for 
e.g. computing a minimum spanning tree. Random walk computation, on the other hand, is not 
deterministic; the input requires parameters such as the length of the walk i and even if the source 
node and i are fixed, the solution (i.e. the walk) is not uniquely determined. While even other 
problems, such as MST, can have multiple solutions, for optimization problems, verification is well- 
defined. It is not clear what it even means to verify whether a random walk is correct. For this 
reason, proving a lower bound of random walk computation through verification problems seems 
impossible. 

Additionally, in terms of the theoretical bound we obtain, a key difficulty in our result is to 
introduce the graph parameter diameter (i.e., D) into the lower bound multiplied by L A crucial 
shortcoming in extending previous work in this regard is that the relationship between communica- 
tion complexity and distributed computing shown in [9j does not depend on the network diameter D 
at all. In fact, such a relationship might not exist since the result in [9] is tight for some functions. 

To overcome these obstacles, we consider a variation of communication complexity called r -round 
two-party communication complexity, which has been successfully used in, e.g., circuit complexity 
and data stream computation (see, e.g., |28] ) . We obtain a new connection showing that a fast 
distributed algorithm for computing a function can be converted to a two-party communication 
protocol with a small message complexity and number of rounds to compute the same function. 
Moreover, the larger the network diameter is, the smaller the number of rounds will be. To obtain 
this result one need to deal with a more involved proof; for example, the new proof does not seem 
to work for the networks previously considered [9l [HI [221 [Ml [30] and thus we need to introduce a 
new network called G{T,k,A) (which is essentially an extension of the network in [30]). This 
result and related definitions are stated and proved in Section [2l 

A particular communication complexity result that we will use is that of Nisan and Wigder- 
son [28] for the r-round pointer chasing problem. Using the connection established in Section [2l we 
derive a lower bound of any distributed algorithms for solving the pointer chasing problem on a 
distributed network. This result is in Section [3l 
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Finally, we prove Theorem 11.11 from the lower bound result in Section [3j The main idea, which 
was also used in |11] . is to construct a network that has the same structure as ^(r, k, A) (thus has 
the same diameter and number of nodes) but different edge capacities (depending on the input) so 
that a random walk follows a desired path (which is unknown) with high probability. This proof is 
in Section HI 

2 From bounded-round communication complexity to distributed 
algorithm lower bounds 

Consider the following problem. There are two parties that have unbounded computational power. 
Each party receives a 6-bit string, for some integer 6 > 1, denoted by x and y in {0, 1}^. They both 
want to together compute f{x,y) for some function / : {0, 1}* x {0, 1}'' — >• M. At the end of the 
computation, the party receiving y has to output the value of f{x,y). We consider two models of 
communication. 

• r -round direct communication: This is a variant of the standard model in communication 
complexity (see [28j and references therein). Two parties can communicate via a bidirectional 
edge of unlimited bandwidth. We call the party receiving x Alice, and the other party Bob. 
Two parties communicate in rounds where each round Alice sends a message (of any size) to 
Bob followed by Bob sending a message to Alice. 

• Communication through network G{T,k,A): Two parties are distinct nodes in a distributed 
network G{T,k,A), for some integers T and A and real At; all networks in G{T,k,A) have 
Q{k.TA'^) nodes and a diameter of 0(kA). (This network is described below.) We denote the 
nodes receiving x and y hy s and t, respectively. 

We consider the public coin randomized algorithms under both models. In particular, we assume 
that all parties (Alice and Bob in the first model and all nodes in G{T, k, A) in the second model) 
share a random bit string of infinite length. For any e > 0, we say that a randomized algorithm 
A is e-error if for any input, it outputs the correct answer with probability at least 1 — e, where 
the probability is over all possible random bit strings. In the first model, we focus on the message 
complexity, i.e., the total number of bits exchanged between Alice and Bob, denoted by Re^^'^~^^^ (f) . 
In the second model, we focus on the running time, denoted by ^^('"''''^)'*'*(j). 

Before we describe G(r, k. A) in detail, we note the following characteristics which will be used 
in later sections. An essential part of G(r,K,A) consists of F paths, denoted by V^,. . . and 
nodes s and t (see Fig. [2]) . Every edge induced by this subgraph has infinitely many copies (in other 
words, infinite capacity). (We let some edges to have infinitely many copies so that we will have 
a freedom to specify the number of copies later on when we prove Theorem 11.11 in Section [H The 
leftmost and rightmost nodes of each path are adjacent to s and t respectively. Ending nodes on the 
same side of the path (i.e., leftmost or rightmost nodes) are adjacent to each other. The following 
properties of G(F,k, A) follow from the construction of G(F,k, A) described in Section [2.2[ 

Lemma 2.1. For any F > 1, k > 1 and A>2, network G(F,k,A) has 0(FkA'^) nodes. Each of 
its path "P* has Q{kA'^) nodes. Its diameter is Q{kA). 

Proof. It follows from the construction of G(F, n. A) in Section [22] that the number of nodes in each 
path is 4''j = Q{kA'^) (cf. Eq. Q). Since there are F paths, the number of nodes in 

all paths is ©(FkA''). Each highway W has 2[«:]A' + 1 nodes. Therefore, there are X]|=i(2[k] A* + l) 
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nodes in the highways. For A > 2, the last quantity is AI-'^J). Hence, the total number of 

nodes is Q{rKA'^). 

To analyze the diameter of G{T, k, A), observe that each node on any path V" can reach a node 
in highway Ti^'^^ by traveling through 0{kA) nodes in P*. Moreover, any node in highway Ti^ can 
reach a node in highway by traveling trough 0(A) nodes in Finally, there are 0{kA) nodes 
in Ti^. Therefore, every node can reach any other node in 0{kA) steps by traveling through Ti^. 
Note that this upper bound is tight since the distance between s and t is Q{kA). □ 

The rest of this section is devoted to prove Theorem 12.31 which strengthens Theorem 3.1 in [S]. 
Recall that Theorem 3.1 in [9] states that if there is a fast e-error algorithm for computing function 
/ on any network G(r, /t, A), then there is a fast e-error algorithm for Alice and Bob to compute /, 
as follow^ 

Theorem 2.2 (Theorem 3.1 in [9J). Consider any integers P > 1, A > 2, real k > 1 and function 
f : {0, 1}^ X {0, 1}^ — ^ M. Let r = Por any b, if r < kA'^ then f can be computed 

by a direct communication protocol using at most (2k log n)r communication bits in total. In other 
words, 

R^-^^-P^\f) < (2Klogn)i?f(^''^'^)'^'*(/). 

The theorem above does not try to optimize number of rounds used by direct communication pro- 
tocols. In fact, a closer look into the proof of Theorem 3. 1 in [9] reveals that e((2K log n)R?^^''''^^'''\f)) 
rounds of communication are used. 

Theorem 12.31 stated below strengthens the above theorem by making sure that the number of 
rounds needed in the direct communication is small. In particular, it says that if there is a fast 
e-error algorithm for computing function / on any network G(r, k, A), then there is a fast bounded- 
round e-error algorithm for Alice and Bob to compute /. More importantly, the number of rounds 
depends on the diameter of G{T,k,A) (which is 0(kA)), i.e., the larger the network diameter, the 
smaller the number of rounds. 

Theorem 2.3. Consider any integers F > 1, A > 2, real k> 1 and function f : {0, l}''x{0, 1}'' — t- M. 
Let r = /^^('"''^'^)'*'*(j)_ Por any b, if r < kA" then f can be computed by a ^-round direct 
communication protocol using at most (2Klogn)r communication bits in total. In other words, 

R7^- ^^-^"^(/)<(2.1ogn)i?f(r''^'^)'^'*(/). 

2.1 Preliminary: the network F(r, /t. A) 

Before we describe the construction of G(F, k. A), we first describe a network called -F(r, k. A) which 
is a slight modification of the network introduced in [30j. In the next section, we show how we 
modify F(F,k,A) to obtain G{T,k,K). 

G(r, K, A) has three parameters, a real k > 1 and two integers F > 1 and A > 20 The two basic 
units in the construction of -F(F, k. A) are highways and paths. 

^Note that Theorem 3.1 in [9] is in fact stated on a graph different from G(r, A) but its proof can be easily 
adapted to prove Theorem 12.21 

^Note that we could restrict k to be an integer here since FiV, k, A) = F(r, A) for any A, F, k and k' such that 
[kJ = [k'J . However, we will need k to be a real when we define G{V, k. A) so we allow it to be a real here as well to 
avoid confusion. 
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Figure 1: An example of _F(r, k, A) where A = 2 and 2 < k < 3. 



Highways. There are [kJ highways, denoted by H^, TH?^ . . ., ^L«J. The highway TV" is a path of 
2[k] A* + 1 nodes, i.e., 

y{"l~L ) = {^0' ^±AW-i' ^±2AW-i' ^±3AW-" ■ ■ ■ ' ^irKlA'AL-^J-i} 

Em = {(/iL(,-+i)AW-„/iVAW-.), (/^i-AW-.^^a+DAW-O \0<j< MA^ . 

We connect the highways by adding edges between nodes of the same subscripts, i.e., for any 
< i < [kJ and — [k]A* < j < [«^1A*, there is an edge between and 
For any j ^ 0, let 

4>j = 1 if J = 0, and = A otherwise. (1) 

We use to specify the number of nodes in the paths (defined next), i.e., each path will have 
Z^y^r taIkI 'P'i nodes. Note that 

J — — I K I AL J J 

MaL^J 

(j)'. = (2[k]AL^J + 1)A = e(KAL«J+i). (2) 



Paths. There are F paths, denoted by V^jV"^, . . . ,V^. To construct each path, we first construct 
its subpaths as follows. For each node h]-'^^ in T^L^J g^nd < i < F, we create a subpath of "P*, 
denoted by Pj, having nodes. Denote nodes in in order by Vj i,Vj 2, ■ ■ ■ , v^j ■ connect these 
paths together to form Vj, i.e., for any j > 0, we create edges (v* ^, , VJ_^_l ^) and {v^_j ^, , v^_^j_^^-^ j). 
Let 

^-oo = ^LmaW,0'_^^^^^„j and vl, = v\^^j,,^,^^,^^^^^^. 

These two nodes can be thought of as the leftmost and rightmost nodes of path V". We connect the 
paths together by adding edges between the leftmost (rightmost, respectively) nodes in the paths, 
i.e., for any i and i' , we add edges (fLoo)'^-oo) ((^^)^^)' respectively). 

K I 

We connect the highways and paths by adding an edge from each node /ij to Vj ^ . We also 
create nodes s and t and connect s {t, respectively) to all nodes v^_^ (?;^, respectively). See Fig. [1] 
for an example. 
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Figure 2: An example of G{T, k, A) where k — 2.5 and A = 2. The dashed edges (in red) have one copy while other 
edges have infinitely many copies. Note that (j}'iQ = 4 and thus there are 4 nodes in each subpath Vlo, for all i. Note 
also that (f/'io is less than ^lo which is 6. 



2.2 Description of G(r, k, A) 

We now modify F{T,k,A) to obtain G{T,k,,A). Again, G{T,k,A) has three parameters, a real 
K > 1 and two integers F > 1 and A > 2. The two basic units in the construction of G(F, k, A) 
are highways and paths. The highways are defined in exactly the same way as before. The main 
modification is the definition of (p' (cf. Eq. ([1])) which affects the number of nodes in the subpaths 
Vj of each path V\ 

Definition of (j)'. First, for a technical reason in the proof of Theorem 12 .lit we need cp'^ to be small 
when \j\ is small. Thus, we define the following notation (p. For any j, define 

_K_ +1 

Note that (pj can be viewed as the number of nodes in T-Li with subscripts between and j, i.e., 

^ f|{4 I 0<j'<i}| ifj>0 

' \|{4li</<0}| ifj<o. 

We now define (p' as follows. For any j > 0, let 

(p'j = (p'_j = min < 0j,max(l, [[k]A''] — ^ cpj' 
( j'>j 

The reason we define (p' this way is that we use it to specify the number of nodes in the paths 
(as described in the previous section) and we want to be able to control this number precisely. In 
particular, while each path in -F(F, k, A) has Q(kA^'^^^^) nodes (cf. Eq. ([2])), the number of nodes 
in each path in G(F, k, A) is 

<p'j = e{^A^). (3) 

j=-[K]AL«J 

We need this precision so that we can deal with any value of i when we prove Theorem 11.11 in 
Section |H 
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Finally, we make infinite copies of every edge except highway edges, i.e., those in L)\^^E{T-l^). (In 
other words, we make them have infinite capacity). As mentioned earlier, we do this so that we will 
have a freedom to specify the number of copies later on when we prove Theorem 11.11 in Section SI 
Observe that if Theorem 12.31 then it also holds when we set the numbers of edge copies in G(T, k, A) 
to some specific numbers. Fig. [2] shows an example of G{T,k,A). 

2.3 Terminologies 

For any numbers i, j, i' , and j', we say that > (hj) if > i or {i' = i and j' > j). For any 

— [k]AL'^J < "i < [k]AI-'^-I and 1 < j < 4>i, define the {i,j)-set as 

^ ^ Uhf \l<x<K, i' <i}U {vf,-, I 1 < X < r, > U{s} if i > 

''^ ~ \{hf \ I < X < K, i' >i}U {vf, -, I 1 < X < r, > U {r} if i < . 

See Fig. [3] for an example. For convenience, for any i > 0, let 

'S'j.o = 'S'i-i,</,'._^ and S-i^ = S-{i-i),4>'_^^_^^ > 

and, for any j, let 

5'r«;]AW+i,j = 'S'rKlAW,^' and S_^^-^^i^i^-^j = S_^^-^^i.i^^, " 

Let A be any deterministic distributed algorithm run on G{T, k, A) for computing a function /. 
Fix any input strings x and y given to s and t respectively. Let ip_/[{x,y) denote the execution of 
^ on X and y. Denote the state of the node v at the end of time r during the execution Lpj({x,y) 
by a_A{v,T,x,y). Let a^{v,0, x,y) be the state of the node v before the execution <^^(x,y) begins. 
Note that a_A{v,0,x,y) is independent of the input if f ^ {•s,t}, depends only on x if u = s and 
depends only on y ii v = t. Moreover, in two different executions Lpj\^{x,y) and ip_4^{x' ,y'), a node 
reaches the same state at time r (i.e., a_A{v,T, x,y) = aj^{v,T,x',y')) if and only if it receives the 
same sequence of messages on each of its incoming links. 

For a given set of nodes U = {vi, . . . , v^} , a, configuration 

CA{U,T,x,y) =< aA{vi,T,x,y),...,aA{ve,T,x,y) > 

is a vector of the states of the nodes of U at the end of time r of the execution ipj\^{x,y). From 
now on, to simplify notations, when A, x and y are clear from the context, we use Cfj to denote 
CA{Si,j,T,x,y). 

2.4 Proof of Theorem [23] 

Let G = G(r,K, A). Let / be the function in the theorem statement. Let Ae be any e-error 
distributed algorithm for computing / on G. Fix a random string f used by Ae (shared by all nodes 
in G) and consider the deterministic algorithm A run on the input of Ae and the fixed random 
string f. Let be the worst case running time of algorithm A (over all inputs). We only consider 
T4 < kA'^, as assumed in the theorem statement. We show that Alice and Bob, when given f as the 
public random string, can simulate A using (2Klogn)T4 communication bits in 8Ta/{kA) rounds, 
as follows. (We provide an example in the end of this section.) 
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Figure 3: An example of round 11 in the proof of Theorem 12.31 (see detail in Example 12. 6p . 



Rounds, Phases, and Iterations. For convenience, we will name the rounds backward, i.e., 
Alice and Bob start at round [k]AL''J and proceed to round [k]AI-'^-I — 1, [k]AL''J — 2, and so on. 
Each round is divided into two phases, i.e., when Alice sends messages and Bob sends messages 
(recall that Alice sends messages first in each iteration). Each phase of round r is divided into (j)',. 
iterations. Each iteration simulates one round of algorithm A. We call the i*'* iteration of round r 
when Alice (Bob, respectively) sends messages the iteration Ir,A,i iIr,B,i, respectively). Therefore, 
in each round r we have the following order of iterations: Ir,A,i, Ir,A,2, ■ ■ Ir,A,<j>'^-, Ir,B,i, ■ ■ ■, Ir,B,<f>'^- 
For convenience, we refer to the time before communication begins as round [k]AL''J + 1 and let 

Ir,Afi = 4+l,A,(/);^i and Ir,Bfi = ^r+l,B,(f>'^_^_j^- 

Our goal is to simulate one round of algorithm A per iteration. That is, after iteration Ir,B,i 
finishes, we will finish the 0^/ + «)*^ round of algorithm A. Specifically, we let 

r'=r+l 

and our goal is to construct a protocol with properties as in the following lemma. 

Lemma 2.4. There exists a protocol such that there are at most Klogn bits sent in each iteration 
and satisfies the following properties. For any r > and < i < 

1. after Ir,A,i finishes, Alice and Bob know C*''^^^^j_i ^ and C^Z^^i _j) respectively, and 

2. after Ir,B,i finishes, Alice and Bob know C*''^*_j and y, respectively. 

Proof. We first argue that the properties hold for iteration /|-k]aL«J+i ac before Alice and Bob 
starts communicating. After round r = [k]AI-'^J starts, Alice can compute C^^-^ q = C^^^ i = C^^, 
which contains the states of all nodes in G(T, k, A) except t. She can do this because every node 
except s and t has the same state regardless of the input and the state of s depends only on her 
input string x. Similarly, Bob can compute C^^j-^.^-^^ q = C^^j-^,_^-^^ ^ = C^^, which depends only on 
his input y. 

Now we show that, if the lemma holds for any iteration Ir^A,i-i then it also holds for iteration 
Ir,A,i as well. Specifically, we show that if Alice and Bob know C^^[I^^Zi^\iKi-i i and C^_Z^^7^_f^-_-^y 

respectively, then they will know C*'^'^^^^j„i ^ and C^^^^, _., respectively, after Alice sends at most 
K log n messages. 
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First we show that Ahce can compute C'*'^^[^„j_i ^ without receiving any message from Bob. 
Recall that Alice can compute i if she knows 

• C*'"^iTii 1 1 , and 

• all messages sent to all nodes in Sj._^j^ik]-i i at time tr + i of algorithm A. 

By assumption, Alice knows C^'I^-IJ^^lkj-i ^ which imphes that she knows C'^^'^^jij-i i since 

'S'r-iAW-1,1 ^ 'S'r-(i-l)AW-i,i ■ 

Moreover, observe that all neighbors of all nodes in Sj._^^ik]-i i are in -S^.^j.j^^y^^L-^J-i i- Thus, Alice 
can compute all messages sent to all nodes in S'j._j^[Kj-i ^ at time tr + i of algorithm A. Therefore, 
Alice can compute C*^^[kJ-i i without receiving any message from Bob. 

Now we show that Bob can compute C*^"^!, _ . by receiving at most k log n bits from Alice and 
use the knowledge of C'^^^r^_j^^. Note that Bob can compute C^Z^'^i _j if knows 

• C*'-+fr^ ., and 

• all messages sent to all nodes in S_^^^f _j at time tr + i oi algorithm A. 

By assumption. Bob knows C^Z^^^_^_^^ which implies that he knows C!r,^J,r^_,- since S_^^^' _j C 
S-r,(l>' -i+i- Moreover, observe that all neighbors of all nodes in _j are in except 

, [kJ , [kJ — 1 L L'^J — * 1,1 

"-(r+l)''^-(Lr/AJ+l)' • • • ''^-([r/AiJ+1)' • • • ' "-([r/A W -iJ+1) • 

In other words. Bob can compute all messages sent to all nodes in S_r,(j)'_ -i at time tr + i except 

y.l^_(r+iy'^-r)^---^^^^ i"_(l_r/A<J+l)'"-Lr/A'J'''---'^'^ i'i_(Lr/A W -iJ+1) ' /^-([r/A W "ij ^ 

where M*'"*'*(u, is the message sent from « to f at time t,. + i of algorithm A. Observe further 
that Alice can compute these messages because she knows C'*1^*ZJ)^l„j-i ^ which contains the states 
of 

"-(r+l)' • • • ' "-([r/AiJ+l)' • • • ' "-(Lr/AW-iJ+l) 

at time tr + i — f- (In particular, ^ is a superset of Cq'i^~^ which contains the states 

of h}-^^^_^iy ■ ■ ., ^^Qr/AL«J-ij+i)') ^li'^'^ ^an send these messages to Bob and Bob can compute 
^~r~(t)' -i '^^ iteration. Each of these messages contains at most log n bits since each 

of them corresponds to a message sent on one edge. Therefore, Alice sends at most k log n bits to 
Bob in total. This shows the first property. 

After Alice finishes sending messages, the two parties will switch their roles and a similar protocol 
can be used to show that the second property, i.e., if the lemma holds for any iteration Ir,B,i-i 
then it also holds for iteration Ir,B,i as well. That is, if Alice and Bob know C*'^*r(j_i) and 
^i'^r+(iii)AL'«J-i 1' I'espectively, then Bob can send Atlogn bits to Alice so that they can compute 
and C':;;.^,^,_,^^, respectively. □ 
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Let P be the protocol as in Lemma [2.4[ Alice and Bob will run protocol P until round r', where 
i the lar^ 
Bob knows 



r' is the largest number such that t,.' + > T4. Lemma [2.41 implies that after iteration Ir' ,B,Tj^-t^i 



C 



(note that 4>'_j., — Tj\^ + i^' > 0). In particular, Bob knows the state of node t at time i.e., he 
knows C7^(t, T4, X, y). Thus, Bob can output the output of A which is output from t. 

Since is e-error, the probability (over all possible shared random strings) that A outputs the 
correct value of f{x,y) is at least 1 — e. Therefore, the communication protocol run by Alice and 
Bob is e-error as well. The number of rounds is bounded as in the following claim. 

Claim 2.5. If algorithm A finishes in time T4 < [k]A'' then r' > [k]A'' — 8T4/([k]A). In other 
words, the number of rounds Alice and Bob need to simulate A is 8r4/([K;]A) 

Proof Let R* = 8r4/([K] A) and let r* = A^'^J - R* + 1. Assume for the sake of contradiction that 
Alice and Bob need more than R* rounds. This means that r' < r*. Alice and Bob requiring more 
than R* rounds implies that 

^ = t,. + < MA«. (4) 



It follows that for any r > r*, 



6^ = min I (/>(/if ), max(l, [MA'"] - ^ cpr') | 

\ r'>r / 



(5) 



AN-i 



(6) 

+ 1 (7) 



where Eq. ([5]) follows from the definition of 0^, Eq. ([6]) is because ^^>r* 0^ < ['^1^'*) and Eq. ([7]) 
is by the definition of ^r- Therefore, the total number of steps that can be simulated by Alice and 
Bob up to round r* is 

Mal^J mal^J 

r=r* r=r* 

I,, 1 L^VAL'^J-^J(MA-i) 



> aL«J- 

2 

^ i?*MA 
- 8 

contradicting Eq. (jl]). □ 

Since there are at most wlogn bits sent in each iteration and Alice and Bob runs P for Tj^ 
iterations, the total number of bits exchanged is at most (2Klogn)r4. This completes the proof of 
Theorem 12.31 
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Example 2.6. Fig.[3]shows an example of the protocol we use above. Before iteration /ii,a,i begins, 
Alice and Bob know Cj^ ^ and C'Ln^, respectively (since Alice and Bob already simulated A for 
'^12 ~ steps in round 12). Then, Alice computes and sends M^(/i^j^2; ^-ii) -^^(^l_i2) ^-lo) 
to Bob. Alice and Bob then compute Cf^ ^ and g, respectively, at the end of iteration Iii^a,i- 
After they repeat this process for five more times, i.e. Alice sends 

M\hl,„hl,,),M'^{hl,„hl,,),...,M'^{hl,„hl,,), and 

Bob will be able to compute q = Ci^^^g 4- Note that Alice is able to compute C| ^, Cj^, . . ., C\\ 
without receiving any messages from Bob so she can compute and send the previously mentioned 
messages to Bob. 



3 The pointer chasing problem 

In this section, we define the pointer chasing problem and prove its lower bound (Lemma l3.2p which 
will be used to prove Theorem 11.11 in the next section. 

Informally, the r-round pointer chasing problem has parameters r and m and there are two 
players, which could be Alice and Bob or nodes s and t, who receive functions Ja ■ [itt] [m] and 
/b '■ [m] — )• [m], respectively. The goal is to compute a function starting from 1 and alternatively 
applying Ja and Jb for r times each, i.e., compute . . f'AifBifA))) where /a and Jb appear r 
times each. To be precise, let Tm be the set of functions / : [m] — )• [m]. For any i > define 
5* : X J^m — ^ ["m] inductively as 



5°(/a,/b) = 1 and 



fAig' ^Ua, fs)) if i > and i is odd, 
fsig^'^ifA, Ib)) if i > and i is even. 



Also define function pc*'"^(/a, /s) = g^^ {f A-, f b) ■ The goal of the r-round pointer chasing problem 
is to compute PC'''™(/yi, /b). 

Observe that if Alice and Bob can communicate for r rounds then they can compute pc'''™ 
naively by exchanging 0(r log m) bits. Interestingly, Nisan and Wigderson [28] show that if Alice 
and Bob are allowed only r — 1 rounds then they essentially cannot do anything better than having 
Alice sent everything she knows to BobH 

Theorem 3.1. i^J^g ^^"''^"^"^(PC^''^) = ^(m/r^ - rlogm). 

The pointer chasing problem on G(r, At, A). We now consider the pointer chasing problem on 
network G{T, k, A) where s and t receive /a and /b respectively. The following lemma follows from 
Theorem [23] and ED 

Lemma 3.2. For any k,T, A>2, m> K^A^'^logn, IGA"'^ >r> 8A''-\ ''''^^''''^(pc''''") = 
J7(kA'^). 



^In fact this holds even when Ahce and Bob are allowed r rounds but Alice cannot send a message in the first 
round. 
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Proof. Let r = i?^fi"''^'^^'*'*(pc'^''"). If r > kA'^ then we are done so we assume that r < kA**. Thus, 



n/3 

g^G(r,f=,A),s,t^p(2,.,mj 



-cc—pub. 



R,,^ ---(per,™) 

- > ^^^^ ^TTT— 3 (8) 



(2k; log n) 



8KA" 



-cc—pub , 



> ^ 1 fg) 

(2k log n) ^ ^ 

^ ^ (m(8A'--i)-^-8A"-Mogm) ^ ^^^^ 

(Klogn) 

= 17(kA'') (11) 

where Eq. ([8]) is by Theorem 12.31 and the fact that r < kA'^, Eq. ([9]) uses the fact that the commu- 
nication does not increase when we allow more rounds and i?f/„^'''''^^'''*(pc^'™) < kK"", Eq. ([TO]) 



-1/3 

follows from Theorem 13.11 with the fact that IGA*^^^ > r > 8A''~^ and Eq. (jlip is because 
m > K^A'^'' logn. □ 

4 Proof of the main theorem 

In this section, we prove Theorem II. li An ^{D) lower bound has already been shown (and is fairly 
straightforward) in [10]; so we focus on showing the J7(\/ZD) lower bound. Moreover, we will prove 
the theorem only for the version where destination outputs source. This is because we can convert 
algorithms for the other two version to solve this version by adding 0{D) rounds. To see this, 
observe that once the source outputs the ID of the destination, we can take additional 0{D) rounds 
to send the ID of the source to the destination. Similarly, if nodes know their positions, the node 
with position i can output the source's ID by taking additional 0{D) rounds to request for the 
source's ID. Theorem I l.H for the case where destination outputs source, follows from the following 
lemma. 

Lemma 4.1. For any real k > 1 and integers A > 2, and T > 32k^A^''~^ logn, there exists a family 
of networks % such that any network H has Q{kTA'^) nodes and diameter D = Q{kA), and any 
algorithm for computing the destination of a random walk of length £ = 0(A^''~^) requires CI{^/ID) 
time on some network H £ Ti. 

Proof. We show how to compute pc'"'™' on G = G{T, k, A) by reducing the problem to the problem 
of sampling a random walk destination in some network Hf^^jg, obtained by restrict the number 
of copies of some edges in G, depending on input functions and fs- We let T-l be the family of 
network Hf^ j^ over all input functions. Note that for any input functions, an algorithm on Hf^ j^ 
can be run on G with the same running time since every edge in G has more capacity than its 
counterpart in Hj^^jg. 

Let r = IGA*^"-*^ and m = K^A^'^logn. Note that 2rm < T. For any i <r and j < m, let 

— p2{i-l)m+j g^^^ rpij _ p2{i-l)m+m+j ^ 
That is, S^'^ =V^,.. ., 5!'™ = V"", T^'l = P'^+l, . . ., Tl''" = V"^"", S^-^ = p2m+l^ _ _ _^ r^r,m ^ p2rm_ 

Let L be the number of nodes in each path. Note that L = Q{kA'^) by Lemma l2.ll Denote the 
nodes in S^'^ from left to right by s\\...,s^^. (Thus, s\^ = v^^^^^^'^'^^ and s^f = i)™+i_) 
Also denote the nodes in T*'-' from right to left by t^'-' , . . . , t^-' . (Thus, t\^ = v'^ i-)m+m+j 
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t^^ = v'^l^ i)m+m+j -J Note that for any i and j, and t^j^ are adjacent to s while s^^ and are 
adjacent to t. 

Now we construct Hf^jg. For simplicity, we fix input functions /a and fs and denote Hf^jg 
simply by H. To get H we let every edge in G have one copy (thus with capacity O(logn)), except 
the following edges. For any i < r, j < m, and x < L, we have (GTl)'^^^^^^^'^^ copies of edges 
between nodes s^'-' and and (6r£)^(*~^)^+-^"'"^ copies of edges between nodes t^-' and i^':^;^. Note 
that these numbers of copies of edges are always the same, regardless of the input /a and fs- 

Additionally, we have the following numbers of edges which depend on the input functions. 
First, s specifies the following number of edges between its neighbors. For any i < r, j < m, we 
have (6r^)^(*^^^^+^ copies of edges between nodes t^^^ and s\'^'^^''\ These numbers of edges can be 
specified in one round since both s^'-' and t\^^^^^ are adjacent to s. Similarly, we have (6r£)^(*~^)^"''^^ 

copies of edges between nodes and g"^^'^^^^^ which can be done in one round since both nodes 
are adjacent to t. This completes the description of H. 

Now we use any random walk algorithm to compute the destination of a walk of length (. = 
2rL — 1 = 0(A^''^^) on H by starting a random walk at s\''^^^\ If the random walk destination is 
t2"' for some j, then node t outputs the number j; otherwise, node t outputs an arbitrary number. 

Now observe the following claim. 

Claim 4.2. Node t outputs pc^'™(/a5/b) with probability at least 2/3. 

Proof. Let P* be the path consisting of nodes sY^^^\ sY^''^\ tY''^^^''^^\ t]^^^^^^^^^\ 

follow path P* with probability at least 2/3. The node of distance (2rL — 1) from s\''^^^^^ in this 
path is ^fj^'fs) — ^2^*^ '•^^ and thus the algorithm described above will output PC^'™'(1) with 
probability at least 2/3. 

To prove the above claim, consider any node u in path P* . Let u' and u" be the node before 
and after u in P* , respectively. Let m' and m" be the number of multiedges {u,u') and {u,u"), 
respectively. Observe that m" > 6T£m'. Moreover, observe that there are at most F edges between 
u and other nodes. Thus, if a random walk is at u, it will continue to u" with probability at least 
1 — ^. By union bound, the probability that a random walk will follow P* is at least 1 — 5, as 
claimed. □ 

Thus, if there is any random walk algorithm with running time 0{T) on all networks in Ti then 
we can use such algorithm to solve pc*"'™ (with error probability 1/3) in time 0{T). Using the lower 
bound of computing solving pc^'™ in Lemma 13.21 the random walk computation also has a lower 
bound of Q{kA'^) = ^}{VID) as claimed. □ 

To prove Theorem 11.11 with the given parameters n, D and i, we simply set A and k so that 
kA = D and A^'^~^ = Q{i). This choice of A and k exists since i > D. Setting F large enough so 
that F > 32fi;2A6K-i bgn while F = @{n). (This choice of F exists since £ < {n/{D^logn)y/^.) By 
applying the above lemma. Theorem 11.11 follows. 



5 Conclusion 

In this paper we prove a tight unconditional lower bound on the time complexity of distributed 
random walk computation, implying that the algorithm in [11] is time optimal. To the best of our 
knowledge, this is the first lower bound that the diameter plays a role of multiplicative factor. Our 
proof technique comes from strengthening the connection between communication complexity and 
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distributed algorithm lower bounds initially studied in [9] by associating rounds in communication 
complexity to the distributed algorithm running time, with network diameter as a trade-off factor. 

There are many open problems left for random walk computation. One interesting open problem 
is showing a lower bound of performing a long walk. We conjecture that the same lower bound of 
fi{-\/£D) holds for any i = 0{n). However, it is not clear whether this will hold for longer walks. 
For example, one can generate a random spanning tree by computing a walk of length equals the 
cover time (using the version where every node knows their positions) which is 0{mD) where m is 
the number of edges in the network (see detail in pLlj). It is interesting to see if performing such a 
walk can be done faster. Additionally, the upper and lower bounds of the problem of generating a 
random spanning tree itself is very interesting since its current upper bound of 0{y/rnD) [TT] simply 
follows as an application of random walk computation [llj while no lower bound is known. Another 
interesting open problem prove the lower bound of ^{\/K£D) for some value of £ for the problem 
of performing K walks of length i. 

In light of the success in proving distributed algorithm lower bounds from communication com- 
plexity in this and the previous work [9j, it is also interesting to explore further applications of 
this technique. One interesting approach is to show a connection between distributed algorithm 
lower bounds and other models of communication complexity, such as multiparty and asymmetric 
communication complexity (see, e.g., |23l). One particular interesting research topic is applying this 
technique to distance-related problems such as shortest s-t path, single-source distance computation, 
and all-pairs shortest path. The lower bound of Q{^/n) are shown in [9] for these types of problems. 
It is interesting to see if there is an 0{^/n)-time algorithm for these problems (or any sub- linear time 
algorithm) or a time lower bound of ^J{^/n) exists. The special cases of these problems on complete 
graphs (as noted in [13]) are particularly interesting. Besides these problems, there are still some 
gaps between upper and lower bounds of problems considered in [9] such as the minimum cut and 
generalized Steiner forest. 
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